Verify definitions #116

tirix · 2023-06-08T08:22:17Z

This is an in progress version of a pull request adding the definition soundness checks.
All checks are not implemented yet, and errors are not handled (the function panics if anything goes wrong).

The intend is to follow the algorithm provided by @digama0 in #103 to identify definitions.

Based on the identification of definitions, this also adds a function to export the dependencies of definition into a GraphML file format. Redundant dependencies are not checked (for example, df-ipf includes two conjunctions, so the edge between df-ipf and df-an is doubled).

Currently this algorithm fails for these reasons:

the axiom ax-hilex is introduced while there are still several definitions pending (e.g. csh is defined before, but df-sh comes after), which breaks the assumption that definition immediately follow the syntax axioms,
the axiom ax-riotaBAD is a redefinition of crio. There should be only one definition for a given definiendum.
the syntax axiom cmesy for mESyn does not have a definition

tirix · 2023-06-08T08:50:02Z

@david-a-wheeler Sorry this is kind of overrules your #103 , but there seemed to be some kind of urgency in introducing these changes now.

david-a-wheeler · 2023-06-08T13:45:12Z

@tirix - no problem! I started writing definitional soundness code because I wanted the functionality, but I always seem to "run out of time" to complete it. Thanks for picking up the effort, I'm delighted to see it.

tirix · 2023-06-08T23:21:00Z

@digama0 Could you please confirm that this corresponds to what you were describing?

tirix

Thanks for all the improvements!
I only have one question.

src/defck.rs

digama0 · 2023-06-12T05:54:42Z

I pushed some more modifications, the current error reports are basically all true positives now, and it might help to start working on them in parallel with the checker itself. Summary of issues:

The line $( $j primitive 'weq' 'wel'; $) marks non-primitives as primitive, which is somewhere between superfluous and an error
chba $a class ~H $. is not marked as primitive (NM)
cmgfs $a class mGFS $. is missing a definition (@digama0)
cmesy $a class mESyn $. is missing a definition (@digama0)
cqpOLD $a class QpOLD $. is missing a definition (@digama0)
cprvb $a wff Prv ph $. is missing a primitive declaration (@benjub)
wcel-wl $a wff x wl-el B $. and wcel2-wl $a wff x wl-el2 B $. are missing a primitive declaration (@wlammen)
wvhc4 .. wvhc12 are missing definitions (AS)

src/defck.rs

src/diag.rs

tirix · 2023-06-12T09:50:51Z

src/diag.rs

+            DefCkSyntaxUsedBeforeDefinition(tok, saddr) => (format!("Definition Check: '{label}' used before definition", label = t(tok)).into(), vec![(
+                AnnotationType::Error,
+                format!("this expression contains an occurrence of '{label}'", label = t(tok)).into(),
+                stmt,
+                stmt.span(),
+            ), (
+                AnnotationType::Note,
+                "syntax declared here".into(),
+                sset.statement(*saddr),
+                sset.statement(*saddr).label_span(),
+            )]),


It would be nice if we could point to the definition if there is one.

src/diag.rs

digama0 · 2023-06-12T23:32:09Z

src/defck.rs

+                let free_dummies = free_dummies
+                    .into_iter()
+                    .map(|atom| {
+                        let sref = self.db.statement_by_label(atom).unwrap();
+                        sref.math_at(1).slice.into()
+                    })
+                    .sorted()
+                    .collect();


One of the things I don't like about the current Formula interface is that variables are represented as $f theorem labels, which makes it difficult to interact with Frame for e.g. DV condition checking (which uses variable indexes and $v Atoms for representing variables); you have to go through this kind of rigamarole to turn those $f labels into variable names.

@tirix There seems to be a bug in the grammar parsing, related to this: the statement parse for df-tru contains vx nodes even though vx is not in scope and has not been defined yet. The actual $f in the context is vx.tru.

Concerning df-tru here is how the grammar parsing works:
The grammar module's FormulaTokenIterator gets the atom for each token using nameck's lookup_symbol:

if let Some(l) = self.nset.lookup_symbol(t.as_bytes()) { ... }

It later uses lookup_float to get the float if happens to be variable. In none of these calls grammar can specify a frame context, and nameck actually only stores the topmost statement for a given token.

digama0 · 2023-06-12T23:35:54Z

We have several applications relying on metamath-knife, and it's always better if they can point to something stable interface-wise.

Note that the API for definition checking changes in every commit, I wouldn't commit to anything until it's feature-complete. And it has some effects on the Formula API too, since this is the first kind of real theorem-prover-ish thing we are trying to do with metamath-knife inside the main repo.

digama0 · 2023-06-12T23:53:04Z

If it implements only some of the rules, but you can invoke the code to check those rules, I suggest merging it.

That will give us a more-stable base to build on.

For clarity perhaps we should document "incomplete implementation" in the help info, to make it clear that it only implements some of the checks. That way no one will accidentally depend on it being the full implementation.

When it comes to writing a verifier, I don't like releasing it half baked when only some of the checks required for soundness are implemented. There is a very concrete end point for the implementation. If we want to release something incomplete, then it should err on the side of more false positives, but we already know that this will cause thousands of errors in set.mm so it won't be so helpful.

digama0 · 2023-06-13T04:03:54Z

Things are almost done now. All the DV checks are implemented, as well as the axiom / def naming checker, which revealed some additional issues that have been pushed to metamath/set.mm#3247 (in particular I put some things as primitive that were actually defined but where the definition was far away from the syntax and other disallowed stuff was in between). There are now three (expected) errors:

warning: Axioms should start with 'ax-'
     --> ../mm/set.mm:24406:3
      |
24406 |   df-clab $a |- ( x e. { y | ph } <-> [ x / y ] ph ) $.
      |   ------- This was identified as an axiom, but it doesn't start with 'ax-'
      |
warning: Axioms should start with 'ax-'
     --> ../mm/set.mm:24478:5
      |
24478 |     df-cleq $a |- ( A = B <-> A. x ( x e. A <-> x e. B ) ) $.
      |     ------- This was identified as an axiom, but it doesn't start with 'ax-'
      |
warning: Axioms should start with 'ax-'
     --> ../mm/set.mm:24547:5
      |
24547 |     df-clel $a |- ( A e. B <-> E. x ( x = A /\ x e. B ) ) $.
      |     ------- This was identified as an axiom, but it doesn't start with 'ax-'
      |

which is basically what I've been saying all along. Options now are to add some kind of override, or change the names.

tirix · 2023-06-13T07:42:45Z

Options now are to add some kind of override, or change the names.

This deserves a wider discussion.
I think Norm wanted to keep the ax- prefix for actual axioms of set theory, so I prefer the former.

At least df-cleq can clearly be linked with the wceq syntax, and df-clel with wcel, why do they need to be declared primitives?
For df-clab, obviously it's a special case.

Interestingly a definition command appears in set.mm, even though the metamath-knife def checker does not use it:

  $( Register '<->' as an equality for its type (wff). $)
  $( $j
    equality 'wb' from 'biid' 'bicomi' 'bitri';
    definition 'dfbi1' for 'wb';
  $)

I did not find its trace in MMJ2. Did I miss something?

tirix · 2023-06-13T08:09:32Z

Interestingly a definition command appears in set.mm

Actually this is defined in j_syntax.html:

$j definition 'DEFTHM' for 'SYNTAX';

Declare a theorem to be a definition. DEFTHM is a $p statement which represents an alternative definition for the syntax represented by the $a statement SYNTAX. The definition should have a top level equality declared by the equality command with the definition on the right hand side. (This command is only needed when the definition does not immediately follow the syntax itself, which in set.mm only occurs for df-bi. For most definitions we can
automatically infer the requisite structure.)

digama0 · 2023-06-13T15:26:50Z

At least df-cleq can clearly be linked with the wceq syntax, and df-clel with wcel, why do they need to be declared primitives?

The answer to this is basically given by the same tool that is producing the output above. Being 'linked' to an axiom that does not match the requirements for a definition is not sufficient. In fact, the ax/df check is done early, before we have even done most of the fancy work on definitions: df-cleq fails immediately for failing the test "it introduces exactly one new symbol and is the first axiom to refer to that symbol". wceq has been used for hundreds of prior theorems before df-cleq comes along so it is obviously ineligible. The story is the same for df-clel.

Interestingly a definition command appears in set.mm, even though the metamath-knife def checker does not use it:

This was part of an alternative design for the definition justification checker, which would parse the definition theorem to supply the definition body, and then ensure that the justification matches. In the design implemented here, the justification is unified against the definitional axiom to obtain the LHS -> RHS mapping, so the definition is not needed. We can validate it if you think it is worth it though.

jkingdon · 2023-06-20T06:09:49Z

Options now are to add some kind of override, or change the names.

I would be a bit disappointed if we get stuck on which of these two to choose and end up not getting the definition verifier merged.

Based on my limited understanding, I guess an override is what we have been doing (maybe expressed in words, maybe in the mmj2 definition checker), and a name change is seemingly more intellectually honest in light of things like https://us.metamath.org/mpeuni/bj-ax8.html .

Disclaimer: although I have looked at things like https://math.stackexchange.com/questions/231087/what-can-i-do-with-proper-classes and https://en.wikipedia.org/wiki/Zermelo%E2%80%93Fraenkel_set_theory#Virtual_classes I'm not sure I understand this fully. I do gather that df-clab , df-clel , and df-cleq do not give us the ability to (say) quantify over classes, so in that sense the classes are still virtual. And I suppose maybe it would be true that one way out would be separate notations for = and e. for sets versus classes (like we already have for [ versus [.). I don't know how much of the problem this would solve, but I do suspect it would be inconvenient, or at least require changing a lot of proofs.

I'm not aware of any quick fix other than "some kind of override, or change the names" so we probably are left with those two choices for now regardless of what we do longer term.

digama0 · 2023-06-20T06:54:33Z

Quick comment, this PR isn't blocked on that issue (alone), I have some local work which continues the main implementation which got pushed off of my "to do immediately" list by something else, but which I plan to return to soon. I did prepare a version which just stubs out the missing part of the checker so that it could be merged, but as expected this leaves about 1300 errors, so it won't be immediately usable.

jkingdon · 2023-06-20T15:24:29Z

Quick comment, this PR isn't blocked on that issue (alone)

Thanks for the clarification. I'm mostly thinking of trying to coordinate:

The definition work (however much of it we are able/interested in merging soon)
Splitting into two or more crates, and
Adding the checker for $j usage

Not that we have a detailed plan for exactly who does all of this and how but those seem like three of the most wanted enhancements that I've noticed.

digama0 · 2023-06-20T23:20:48Z

I'd prefer to wait on (2) until most of the in-flight additions have been merged. For (3) I think it can be worked on independently of this issue, they are not really conflicting.

jkingdon · 2023-06-23T06:01:28Z

I'd prefer to wait on (2) until most of the in-flight additions have been merged. For (3) I think it can be worked on independently of this issue, they are not really conflicting.

Makes sense. Looks like (2) is being discussed at #117 and (3) has a pull request at #118 .

tirix · 2023-11-27T11:38:34Z

I'm in favor of merging this first, and handling the remaining questions over df-clab, df-clel and df-cleq separately.

If this is merged, we could already implement a metamath-knife based definition checker in set.mm`s CI by just ignoring the 3 exceptions, until they are resolved.

jkingdon · 2023-11-27T16:06:15Z

I'm in favor of merging this first, and handling the remaining questions over df-clab, df-clel and df-cleq separately.

Just to confirm something I think I know the answer to: is the definition checker complete (assuming that it isn't supposed to handle those three)? That is, does it implement the rules at "Additional rules for definitions" in https://us.metamath.org/mpeuni/conventions.html ?

If this is merged, we could already implement a metamath-knife based definition checker in set.mm`s CI by just ignoring the 3 exceptions, until they are resolved.

I mean, this is what we do for mmj2, right? If so, it is hard to see merging this, and adding to CI, as a step backward.

digama0 · 2023-11-27T23:52:10Z

Since I have had a WIP commit sitting in my local copy for too long, I've pushed it to tirix/metamath-knife@verify_definitions...metamath:metamath-knife:verify_definitions_2 .

tirix · 2023-11-28T08:47:06Z

So would @digama0 you like me to create a pull request with those changes, review them, and merge them to this branch?

tirix added 5 commits June 4, 2023 20:06

Add function to export theorem dependencies in GraphML format

66383c8

Remove unwanted file!

efd0db3

Fix typo

2d388c2

Skipping syntactic dependencies

43b89d5

Definition check and definition graph export

414fdd7

tirix mentioned this pull request Jun 8, 2023

--export-graphml-deps error despite PR request for this feature approved #113

Closed

tirix added 2 commits June 8, 2023 10:42

Clippy and more one more check

293fd6e

Fmt

5facc9d

Error handling

91ed49f

tirix requested a review from digama0 June 8, 2023 22:48

tirix marked this pull request as ready for review June 8, 2023 22:48

tirix added 2 commits June 9, 2023 01:16

Special malformed definition case

bc0cd3a

Clppy

4212498

organizing the parts that have been done so far

7fdcbab

digama0 force-pushed the verify_definitions branch from 861ec39 to 7fdcbab Compare June 10, 2023 23:17

tirix commented Jun 11, 2023

View reviewed changes

src/defck.rs Outdated Show resolved Hide resolved

tirix mentioned this pull request Jun 11, 2023

Add function to export theorem dependencies in GraphML format #112

Merged

tirix and others added 3 commits June 11, 2023 20:42

Remove unused diagnostic type

1ce90bc

refactoring

b0e7f20

check that statements don't use pending defs

4005a02

digama0 mentioned this pull request Jun 12, 2023

fix definition bugs metamath/set.mm#3247

Merged

check refl, symm, trans

7e64ac3

tirix commented Jun 12, 2023

View reviewed changes

src/defck.rs Outdated Show resolved Hide resolved

tirix commented Jun 12, 2023

View reviewed changes

src/diag.rs Outdated Show resolved Hide resolved

tirix commented Jun 12, 2023

View reviewed changes

src/diag.rs Outdated Show resolved Hide resolved

digama0 reviewed Jun 12, 2023

View reviewed changes

digama0 added 5 commits June 12, 2023 20:37

DV condition for justifications

43c4f60

parameter DV check

5823b9c

dummy DV check

2cd7866

check for definiendum on RHS

c685f4f

misnamed axiom/def warnings

57a8bca

GinoGiotto mentioned this pull request Jun 13, 2023

Prove tz7.48lem without ax-8 metamath/set.mm#3199

Closed

tirix and others added 2 commits June 22, 2023 09:35

Merge branch 'main' into verify_definitions

aa1eae3

Merge branch 'main' into verify_definitions

d21c13b

tirix added 2 commits July 2, 2023 22:38

Merge branch 'main' into verify_definitions

1b77b7a

Merge branch 'main' into verify_definitions

aa4cafc

tirix mentioned this pull request Nov 27, 2023

Split into library and binary #149

Merged

Fix merge

7011aef

Merge branch 'main' into verify_definitions

80490fe

Merge branch 'main' into verify_definitions

ca1e4af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify definitions #116

Verify definitions #116

tirix commented Jun 8, 2023 •

edited

Loading

tirix commented Jun 8, 2023

david-a-wheeler commented Jun 8, 2023

tirix commented Jun 8, 2023

tirix left a comment

digama0 commented Jun 12, 2023

tirix Jun 12, 2023

digama0 Jun 12, 2023

digama0 Jun 13, 2023

tirix Jun 13, 2023 •

edited

Loading

digama0 commented Jun 12, 2023

digama0 commented Jun 12, 2023

digama0 commented Jun 13, 2023

tirix commented Jun 13, 2023 •

edited

Loading

tirix commented Jun 13, 2023

digama0 commented Jun 13, 2023 •

edited

Loading

jkingdon commented Jun 20, 2023

digama0 commented Jun 20, 2023

jkingdon commented Jun 20, 2023

digama0 commented Jun 20, 2023

jkingdon commented Jun 23, 2023

tirix commented Nov 27, 2023

jkingdon commented Nov 27, 2023

digama0 commented Nov 27, 2023

tirix commented Nov 28, 2023

Verify definitions #116

Are you sure you want to change the base?

Verify definitions #116

Conversation

tirix commented Jun 8, 2023 • edited Loading

tirix commented Jun 8, 2023

david-a-wheeler commented Jun 8, 2023

tirix commented Jun 8, 2023

tirix left a comment

Choose a reason for hiding this comment

digama0 commented Jun 12, 2023

tirix Jun 12, 2023

Choose a reason for hiding this comment

digama0 Jun 12, 2023

Choose a reason for hiding this comment

digama0 Jun 13, 2023

Choose a reason for hiding this comment

tirix Jun 13, 2023 • edited Loading

Choose a reason for hiding this comment

digama0 commented Jun 12, 2023

digama0 commented Jun 12, 2023

digama0 commented Jun 13, 2023

tirix commented Jun 13, 2023 • edited Loading

tirix commented Jun 13, 2023

digama0 commented Jun 13, 2023 • edited Loading

jkingdon commented Jun 20, 2023

digama0 commented Jun 20, 2023

jkingdon commented Jun 20, 2023

digama0 commented Jun 20, 2023

jkingdon commented Jun 23, 2023

tirix commented Nov 27, 2023

jkingdon commented Nov 27, 2023

digama0 commented Nov 27, 2023

tirix commented Nov 28, 2023

tirix commented Jun 8, 2023 •

edited

Loading

tirix Jun 13, 2023 •

edited

Loading

tirix commented Jun 13, 2023 •

edited

Loading

digama0 commented Jun 13, 2023 •

edited

Loading